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Analyze application-specific 
HTML DOM and define 
application-specific grammar 



Create yacc source file 
containing defined grammar 
together with C++ statements 
that generate user interface 
(UI) objects 
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Run yacc with source file to 
generate C++ source code file 
for the parser executable 
program 
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Compile yacc output file with 
additional C++ code modules, 
e.g., scanner code module 
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Resulting program accepts 
HTML DOM and produces a 
list of UI objects 
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FIGURE 3 
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Access HTML DOM in 
web browser and serialize 
the tree of DOM elements 
by traversing DOM 
hierarchy 



Create one or more tokens 
for each DOM element 



Parse tokenized input 
according to application- 
specific grammar 



Generate list of UI objects 
that correspond to images 
displayed in the browser 
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Input file containing 
HTML Grammar and 
C++ code outputting UI 
object descriptors 
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